Statistical distribution of amino acid sequences: a proof of Darwinian evolution
نویسندگان
چکیده
MOTIVATION The article presents results of the listing of the quantity of amino acids, dipeptides and tripeptides for all proteins available in the UNIPROT-TREMBL database and the listing for selected species and enzymes. UNIPROT-TREMBL contains protein sequences associated with computationally generated annotations and large-scale functional characterization. Due to the distinct metabolic pathways of amino acid syntheses and their physicochemical properties, the quantities of subpeptides in proteins vary. We have proved that the distribution of amino acids, dipeptides and tripeptides is statistical which confirms that the evolutionary biodiversity development model is subject to the theory of independent events. It seems interesting that certain short peptide combinations occur relatively rarely or even not at all. First, it confirms the Darwinian theory of evolution and second, it opens up opportunities for designing pharmaceuticals among rarely represented short peptide combinations. Furthermore, an innovative approach to the mass analysis of bioinformatic data is presented. CONTACT [email protected] SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
منابع مشابه
Correlation between the substitution rate and rate variation among sites in protein evolution.
It is well known that the rate of amino acid substitution varies among different proteins and among different sites of a protein. It is, however, unclear whether the extent of rate variation among sites of a protein and the mean substitution rate of the protein are correlated. We used two approaches to analyze orthologous protein sequences of 51 nuclear genes of vertebrates and 13 mitochondrial...
متن کاملFasciola gigantica of Ruminants: The phylogenetic analysis based on COX1 sequences
BACKGROUND: Fasciola species are parasitic trematode with world wide distribution that infects wild and domesticated herbivores, particularly ruminants. The aim of the present study was to investigate the intra species variations of F. gigantica, from goats and buffalos isolates in two common geographic climates of Iran. METHODS: Fasciola species were collected from goat, buffalo, sheep, and ca...
متن کاملBioinformatics study of complete amino acid sequences of neuraminidase (NA) antigen of H1N1 influenza viruses from 2006 to 2013 in Iran
Introduction: Influenza is a contagious acute viral disease of the respiratory tract that causes fever, headache, muscle aches and cough. One of the unique features of influenza virus is antigenic variation in viral protein neuraminidase (NA) which causes emergence of new virus variants. NA is responsible for the release and spread of progeny virions. Due to the continuous changes of NA genes, ...
متن کاملStatistical Aspects of the Non-darwinian Theory
Quite apart from its possible biological relevance, the non-Darwinian theory of evolution currently under discussion is of considerable interest to statisticians. This is so, of course, because any mathematical formulation of the proposition that a considerable proportion of observed allelic substitutions have no selective significance and have occurred purely by chance, and, hence, any quantit...
متن کاملPhylogenetic and sequence analysis of the growth hormone gene of two sturgeons, Huso huso and Acipenser Gueldenstaedtii
In this study, the cDNA Growth Hormone (cGH) of the Belugasturgeon (Husohuso) and Russian sturgeon (Acipensergueldenstaedtii) were cloned and sequenced, and phylogenetic relationships were examined using nucleic acid and amino acid sequences. The nucleotide sequence of the Beluga GH has an open reading frame of 645 nucleotides encoding a protein 214 amino acid residues. The signal peptide cleav...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bioinformatics
دوره 26 23 شماره
صفحات -
تاریخ انتشار 2010